Rank in Wordlist | Frequency | Word |
---|---|---|
7303 | 74 | 1,5 |
7471 | 72 | 2,5 |
10110 | 50 | 4,5 |
10778 | 46 | 0,2% |
10779 | 46 | 0,4% |
10780 | 46 | 1,3 |
11161 | 44 | 3,5 |
11571 | 42 | 0,1% |
11572 | 42 | 0,5% |
11795 | 41 | 0,8% |
Rank in Wordlist | Frequency | Word |
---|---|---|
58775 | 4 | Mattinonline(IN)TOLLERANZA |
67135 | 3 | A(H1N1 |
77528 | 3 | italiana(periodo |
79869 | 3 | request(s |
84532 | 2 | A-Z(Questa |
87120 | 2 | Comment(s |
87396 | 2 | Crivelli(Travanti |
89423 | 2 | Grigioni(LE |
89806 | 2 | I63IMP6A(W)/FR |
89807 | 2 | I6I6C6A(W)/FR |
Rank in Wordlist | Frequency | Word |
---|---|---|
12246 | 39 | %) |
16021 | 28 | meteo)Impressum |
18711 | 22 | 1914-2014)» |
19985 | 20 | %). |
21509 | 18 | CHF)Articoli |
25801 | 14 | PAR)lampade |
28391 | 12 | 1965-2015)Archivi |
29490 | 12 | invio)Vai |
35168 | 9 | anni)Esploratori |
35169 | 9 | anni)Pionieri |
Rank in Wordlist | Frequency | Word |
---|---|---|
2568 | 242 | 10% |
3018 | 203 | 50% |
3265 | 188 | 20% |
3515 | 173 | 100% |
3786 | 159 | 5% |
4341 | 136 | 2% |
4815 | 121 | 40% |
5075 | 114 | 60% |
5145 | 112 | 30% |
5563 | 102 | 25% |
Rank in Wordlist | Frequency | Word |
---|---|---|
11630 | 42 | S&P |
27122 | 13 | M&G |
28829 | 12 | S&L |
30131 | 11 | B&B |
30318 | 11 | H&M |
37912 | 8 | Ticino&Lavoro |
50386 | 5 | G&B |
56353 | 4 | 12&U |
58674 | 4 | M&A |
59349 | 4 | R&B |
Rank in Wordlist | Frequency | Word |
---|---|---|
8315 | 63 | $inistra |
11184 | 44 | P$ |
19984 | 20 | $torici |
22545 | 17 | P$$ |
26826 | 13 | $ocialista |
36826 | 8 | $inistri |
43554 | 7 | ro$$i |
44092 | 6 | $inistruccia |
45600 | 6 | R$I |
49248 | 5 | $ocialisti |
Rank in Wordlist | Frequency | Word |
---|---|---|
523 | 1079 | c'è |
1579 | 400 | quest'anno |
1581 | 399 | l'anno |
1648 | 381 | dell'anno |
1726 | 363 | nell'ambito |
1870 | 336 | all'interno |
1919 | 327 | all'estero |
2115 | 298 | l'altro |
2161 | 292 | un'altra |
2586 | 240 | dell'economia |
Rank in Wordlist | Frequency | Word |
---|---|---|
2912 | 211 | https://www |
4185 | 142 | e/o |
5609 | 101 | Ticinohttp://www |
5961 | 94 | km/h |
8726 | 60 | https://t |
9776 | 52 | 2/3 |
13580 | 34 | 2016/2017 |
13884 | 33 | 2015/2016 |
16784 | 26 | categoria/Animali |
18714 | 22 | 2014/15 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots